Overview

Dataset statistics

Number of variables16
Number of observations48895
Missing cells20141
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.5 MiB
Average record size in memory503.0 B

Variable types

Numeric10
Categorical6

Alerts

name has a high cardinality: 47905 distinct values High cardinality
host_name has a high cardinality: 11452 distinct values High cardinality
neighbourhood has a high cardinality: 221 distinct values High cardinality
last_review has a high cardinality: 1764 distinct values High cardinality
id is highly correlated with host_idHigh correlation
host_id is highly correlated with idHigh correlation
number_of_reviews is highly correlated with reviews_per_monthHigh correlation
reviews_per_month is highly correlated with number_of_reviewsHigh correlation
id is highly correlated with host_idHigh correlation
host_id is highly correlated with idHigh correlation
number_of_reviews is highly correlated with reviews_per_monthHigh correlation
reviews_per_month is highly correlated with number_of_reviewsHigh correlation
number_of_reviews is highly correlated with reviews_per_monthHigh correlation
reviews_per_month is highly correlated with number_of_reviewsHigh correlation
id is highly correlated with host_idHigh correlation
host_id is highly correlated with idHigh correlation
neighbourhood_group is highly correlated with latitude and 1 other fieldsHigh correlation
latitude is highly correlated with neighbourhood_group and 1 other fieldsHigh correlation
longitude is highly correlated with neighbourhood_group and 1 other fieldsHigh correlation
last_review has 10052 (20.6%) missing values Missing
reviews_per_month has 10052 (20.6%) missing values Missing
minimum_nights is highly skewed (γ1 = 21.82727453) Skewed
name is uniformly distributed Uniform
id has unique values Unique
number_of_reviews has 10052 (20.6%) zeros Zeros
availability_365 has 17533 (35.9%) zeros Zeros

Reproduction

Analysis started2022-02-22 06:49:52.252690
Analysis finished2022-02-22 06:50:14.578188
Duration22.33 seconds
Software versionpandas-profiling v3.1.0
Download configurationconfig.json

Variables

id
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
UNIQUE

Distinct48895
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19017143.24
Minimum2539
Maximum36487245
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:14.679515image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum2539
5-th percentile1222382.7
Q19471945
median19677284
Q329152178.5
95-th percentile35259101.2
Maximum36487245
Range36484706
Interquartile range (IQR)19680233.5

Descriptive statistics

Standard deviation10983108.39
Coefficient of variation (CV)0.5775372383
Kurtosis-1.227748342
Mean19017143.24
Median Absolute Deviation (MAD)9908242
Skewness-0.09025737546
Sum9.298432185 × 1011
Variance1.206286698 × 1014
MonotonicityStrictly increasing
2022-02-22T12:20:14.855418image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25391
 
< 0.1%
255833661
 
< 0.1%
255516871
 
< 0.1%
255520761
 
< 0.1%
255541201
 
< 0.1%
255688731
 
< 0.1%
255716271
 
< 0.1%
255728921
 
< 0.1%
255801131
 
< 0.1%
255802831
 
< 0.1%
Other values (48885)48885
> 99.9%
ValueCountFrequency (%)
25391
< 0.1%
25951
< 0.1%
36471
< 0.1%
38311
< 0.1%
50221
< 0.1%
50991
< 0.1%
51211
< 0.1%
51781
< 0.1%
52031
< 0.1%
52381
< 0.1%
ValueCountFrequency (%)
364872451
< 0.1%
364856091
< 0.1%
364854311
< 0.1%
364850571
< 0.1%
364846651
< 0.1%
364843631
< 0.1%
364840871
< 0.1%
364831521
< 0.1%
364830101
< 0.1%
364828091
< 0.1%

name
Categorical

HIGH CARDINALITY
UNIFORM

Distinct47905
Distinct (%)98.0%
Missing16
Missing (%)< 0.1%
Memory size4.4 MiB
Hillside Hotel
 
18
Home away from home
 
17
New york Multi-unit building
 
16
Brooklyn Apartment
 
12
Private Room
 
11
Other values (47900)
48805 

Length

Max length179
Median length37
Mean length36.91114794
Min length1

Characters and Unicode

Total characters185
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47260 ?
Unique (%)96.7%

Sample

1st rowClean & quiet apt home by the park
2nd rowSkylit Midtown Castle
3rd rowTHE VILLAGE OF HARLEM....NEW YORK !
4th rowCozy Entire Floor of Brownstone
5th rowEntire Apt: Spacious Studio/Loft by central park

Common Values

ValueCountFrequency (%)
Hillside Hotel18
 
< 0.1%
Home away from home17
 
< 0.1%
New york Multi-unit building16
 
< 0.1%
Brooklyn Apartment12
 
< 0.1%
Private Room11
 
< 0.1%
Loft Suite @ The Box House Hotel11
 
< 0.1%
Private room10
 
< 0.1%
Artsy Private BR in Fort Greene Cumberland10
 
< 0.1%
Beautiful Brooklyn Brownstone8
 
< 0.1%
Private room in Brooklyn8
 
< 0.1%
Other values (47895)48758
99.7%
(Missing)16
 
< 0.1%

Length

2022-02-22T12:20:15.011649image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
in16752
 
5.6%
room10038
 
3.4%
8430
 
2.8%
bedroom7601
 
2.5%
private7158
 
2.4%
apartment6695
 
2.2%
cozy4991
 
1.7%
apt4618
 
1.5%
brooklyn4049
 
1.4%
studio3988
 
1.3%
Other values (12552)224301
75.1%

Most occurring characters

ValueCountFrequency (%)
185
100.0%

Most occurring categories

ValueCountFrequency (%)
Control185
100.0%

Most frequent character per category

Control
ValueCountFrequency (%)
185
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common185
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
185
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII185
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
185
100.0%

host_id
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct37457
Distinct (%)76.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67620010.65
Minimum2438
Maximum274321313
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:15.271884image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum2438
5-th percentile815564.1
Q17822033
median30793816
Q3107434423
95-th percentile241764600.2
Maximum274321313
Range274318875
Interquartile range (IQR)99612390

Descriptive statistics

Standard deviation78610967.03
Coefficient of variation (CV)1.162539998
Kurtosis0.1691057556
Mean67620010.65
Median Absolute Deviation (MAD)27543913
Skewness1.206213924
Sum3.306280421 × 1012
Variance6.179684138 × 1015
MonotonicityNot monotonic
2022-02-22T12:20:15.429906image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
219517861327
 
0.7%
107434423232
 
0.5%
30283594121
 
0.2%
137358866103
 
0.2%
1609895896
 
0.2%
1224305196
 
0.2%
6139196391
 
0.2%
2254157387
 
0.2%
20038061065
 
0.1%
750364352
 
0.1%
Other values (37447)47625
97.4%
ValueCountFrequency (%)
24381
 
< 0.1%
25711
 
< 0.1%
27876
< 0.1%
28452
 
< 0.1%
28681
 
< 0.1%
28812
 
< 0.1%
31511
 
< 0.1%
32111
 
< 0.1%
34151
 
< 0.1%
35631
 
< 0.1%
ValueCountFrequency (%)
2743213131
< 0.1%
2743114611
< 0.1%
2743076001
< 0.1%
2742984531
< 0.1%
2742732841
< 0.1%
2742256171
< 0.1%
2741954581
< 0.1%
2741883861
< 0.1%
2741033831
< 0.1%
2740799641
< 0.1%

host_name
Categorical

HIGH CARDINALITY

Distinct11452
Distinct (%)23.4%
Missing21
Missing (%)< 0.1%
Memory size3.0 MiB
Michael
 
417
David
 
403
Sonder (NYC)
 
327
John
 
294
Alex
 
279
Other values (11447)
47154 

Length

Max length35
Median length6
Mean length6.12487212
Min length1

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6903 ?
Unique (%)14.1%

Sample

1st rowJohn
2nd rowJennifer
3rd rowElisabeth
4th rowLisaRoxanne
5th rowLaura

Common Values

ValueCountFrequency (%)
Michael417
 
0.9%
David403
 
0.8%
Sonder (NYC)327
 
0.7%
John294
 
0.6%
Alex279
 
0.6%
Blueground232
 
0.5%
Sarah227
 
0.5%
Daniel226
 
0.5%
Jessica205
 
0.4%
Maria204
 
0.4%
Other values (11442)46060
94.2%

Length

2022-02-22T12:20:15.590576image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1120
 
2.1%
and625
 
1.1%
michael460
 
0.8%
david449
 
0.8%
sonder423
 
0.8%
nyc338
 
0.6%
john337
 
0.6%
alex330
 
0.6%
laura293
 
0.5%
maria244
 
0.4%
Other values (10259)49968
91.5%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

neighbourhood_group
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.0 MiB
Manhattan
21661 
Brooklyn
20104 
Queens
5666 
Bronx
 
1091
Staten Island
 
373

Length

Max length13
Median length8
Mean length8.182452193
Min length5

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBrooklyn
2nd rowManhattan
3rd rowManhattan
4th rowBrooklyn
5th rowManhattan

Common Values

ValueCountFrequency (%)
Manhattan21661
44.3%
Brooklyn20104
41.1%
Queens5666
 
11.6%
Bronx1091
 
2.2%
Staten Island373
 
0.8%

Length

2022-02-22T12:20:15.678479image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-22T12:20:15.740976image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
manhattan21661
44.0%
brooklyn20104
40.8%
queens5666
 
11.5%
bronx1091
 
2.2%
staten373
 
0.8%
island373
 
0.8%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

neighbourhood
Categorical

HIGH CARDINALITY

Distinct221
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
Williamsburg
3920 
Bedford-Stuyvesant
3714 
Harlem
 
2658
Bushwick
 
2465
Upper West Side
 
1971
Other values (216)
34167 

Length

Max length26
Median length12
Mean length11.89479497
Min length4

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)< 0.1%

Sample

1st rowKensington
2nd rowMidtown
3rd rowHarlem
4th rowClinton Hill
5th rowEast Harlem

Common Values

ValueCountFrequency (%)
Williamsburg3920
 
8.0%
Bedford-Stuyvesant3714
 
7.6%
Harlem2658
 
5.4%
Bushwick2465
 
5.0%
Upper West Side1971
 
4.0%
Hell's Kitchen1958
 
4.0%
East Village1853
 
3.8%
Upper East Side1798
 
3.7%
Crown Heights1564
 
3.2%
Midtown1545
 
3.2%
Other values (211)25449
52.0%

Length

2022-02-22T12:20:15.850335image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
east6592
 
8.3%
side4680
 
5.9%
williamsburg3920
 
5.0%
harlem3775
 
4.8%
upper3769
 
4.8%
bedford-stuyvesant3714
 
4.7%
heights3586
 
4.5%
village3164
 
4.0%
west2759
 
3.5%
bushwick2465
 
3.1%
Other values (233)40681
51.4%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

latitude
Real number (ℝ≥0)

HIGH CORRELATION

Distinct19048
Distinct (%)39.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40.72894888
Minimum40.49979
Maximum40.91306
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:15.960263image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum40.49979
5-th percentile40.646114
Q140.6901
median40.72307
Q340.763115
95-th percentile40.825643
Maximum40.91306
Range0.41327
Interquartile range (IQR)0.073015

Descriptive statistics

Standard deviation0.05453007806
Coefficient of variation (CV)0.001338853065
Kurtosis0.1488446574
Mean40.72894888
Median Absolute Deviation (MAD)0.03642
Skewness0.2371665585
Sum1991441.956
Variance0.002973529413
MonotonicityNot monotonic
2022-02-22T12:20:16.069625image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40.7181318
 
< 0.1%
40.6844413
 
< 0.1%
40.6941413
 
< 0.1%
40.6863413
 
< 0.1%
40.7612512
 
< 0.1%
40.6853712
 
< 0.1%
40.7117112
 
< 0.1%
40.7135312
 
< 0.1%
40.7618912
 
< 0.1%
40.6868311
 
< 0.1%
Other values (19038)48767
99.7%
ValueCountFrequency (%)
40.499791
< 0.1%
40.506411
< 0.1%
40.507081
< 0.1%
40.508681
< 0.1%
40.508731
< 0.1%
40.509431
< 0.1%
40.511331
< 0.1%
40.522111
< 0.1%
40.522931
< 0.1%
40.5271
< 0.1%
ValueCountFrequency (%)
40.913061
< 0.1%
40.912341
< 0.1%
40.911691
< 0.1%
40.911671
< 0.1%
40.908041
< 0.1%
40.907341
< 0.1%
40.905271
< 0.1%
40.904841
< 0.1%
40.904061
< 0.1%
40.903911
< 0.1%

longitude
Real number (ℝ)

HIGH CORRELATION

Distinct14718
Distinct (%)30.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-73.95216961
Minimum-74.24442
Maximum-73.71299
Zeros0
Zeros (%)0.0%
Negative48895
Negative (%)100.0%
Memory size382.1 KiB
2022-02-22T12:20:16.228638image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum-74.24442
5-th percentile-74.00388
Q1-73.98307
median-73.95568
Q3-73.936275
95-th percentile-73.865771
Maximum-73.71299
Range0.53143
Interquartile range (IQR)0.046795

Descriptive statistics

Standard deviation0.04615673611
Coefficient of variation (CV)-0.0006241430961
Kurtosis5.021646112
Mean-73.95216961
Median Absolute Deviation (MAD)0.02485
Skewness1.284210209
Sum-3615891.333
Variance0.002130444288
MonotonicityNot monotonic
2022-02-22T12:20:16.385885image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-73.9567718
 
< 0.1%
-73.9542718
 
< 0.1%
-73.9540517
 
< 0.1%
-73.950616
 
< 0.1%
-73.9479116
 
< 0.1%
-73.9533216
 
< 0.1%
-73.9513616
 
< 0.1%
-73.9566915
 
< 0.1%
-73.9574215
 
< 0.1%
-73.9453715
 
< 0.1%
Other values (14708)48733
99.7%
ValueCountFrequency (%)
-74.244421
< 0.1%
-74.242851
< 0.1%
-74.240841
< 0.1%
-74.239861
< 0.1%
-74.239141
< 0.1%
-74.238031
< 0.1%
-74.230591
< 0.1%
-74.212381
< 0.1%
-74.210171
< 0.1%
-74.209411
< 0.1%
ValueCountFrequency (%)
-73.712991
< 0.1%
-73.71691
< 0.1%
-73.717951
< 0.1%
-73.718291
< 0.1%
-73.719281
< 0.1%
-73.721731
< 0.1%
-73.721791
< 0.1%
-73.722471
< 0.1%
-73.724351
< 0.1%
-73.725811
< 0.1%

room_type
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.3 MiB
Entire home/apt
25409 
Private room
22326 
Shared room
 
1160

Length

Max length15
Median length15
Mean length13.53526945
Min length11

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPrivate room
2nd rowEntire home/apt
3rd rowPrivate room
4th rowEntire home/apt
5th rowEntire home/apt

Common Values

ValueCountFrequency (%)
Entire home/apt25409
52.0%
Private room22326
45.7%
Shared room1160
 
2.4%

Length

2022-02-22T12:20:16.573665image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-22T12:20:16.667535image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
entire25409
26.0%
home/apt25409
26.0%
room23486
24.0%
private22326
22.8%
shared1160
 
1.2%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

price
Real number (ℝ≥0)

Distinct674
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean152.7206872
Minimum0
Maximum10000
Zeros11
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:16.808889image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile40
Q169
median106
Q3175
95-th percentile355
Maximum10000
Range10000
Interquartile range (IQR)106

Descriptive statistics

Standard deviation240.1541697
Coefficient of variation (CV)1.572505822
Kurtosis585.6728789
Mean152.7206872
Median Absolute Deviation (MAD)46
Skewness19.118939
Sum7467278
Variance57674.02525
MonotonicityNot monotonic
2022-02-22T12:20:16.934109image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1002051
 
4.2%
1502047
 
4.2%
501534
 
3.1%
601458
 
3.0%
2001401
 
2.9%
751370
 
2.8%
801272
 
2.6%
651190
 
2.4%
701170
 
2.4%
1201130
 
2.3%
Other values (664)34272
70.1%
ValueCountFrequency (%)
011
 
< 0.1%
1017
< 0.1%
113
 
< 0.1%
124
 
< 0.1%
131
 
< 0.1%
156
 
< 0.1%
166
 
< 0.1%
182
 
< 0.1%
194
 
< 0.1%
2033
0.1%
ValueCountFrequency (%)
100003
< 0.1%
99993
< 0.1%
85001
 
< 0.1%
80001
 
< 0.1%
77031
 
< 0.1%
75002
< 0.1%
68001
 
< 0.1%
65003
< 0.1%
64191
 
< 0.1%
60002
< 0.1%

minimum_nights
Real number (ℝ≥0)

SKEWED

Distinct109
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.029962164
Minimum1
Maximum1250
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:17.090570image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q35
95-th percentile30
Maximum1250
Range1249
Interquartile range (IQR)4

Descriptive statistics

Standard deviation20.51054953
Coefficient of variation (CV)2.917590316
Kurtosis854.0716624
Mean7.029962164
Median Absolute Deviation (MAD)2
Skewness21.82727453
Sum343730
Variance420.6826422
MonotonicityNot monotonic
2022-02-22T12:20:17.263342image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
112720
26.0%
211696
23.9%
37999
16.4%
303760
 
7.7%
43303
 
6.8%
53034
 
6.2%
72058
 
4.2%
6752
 
1.5%
14562
 
1.1%
10483
 
1.0%
Other values (99)2528
 
5.2%
ValueCountFrequency (%)
112720
26.0%
211696
23.9%
37999
16.4%
43303
 
6.8%
53034
 
6.2%
6752
 
1.5%
72058
 
4.2%
8130
 
0.3%
980
 
0.2%
10483
 
1.0%
ValueCountFrequency (%)
12501
 
< 0.1%
10001
 
< 0.1%
9993
 
< 0.1%
5005
 
< 0.1%
4801
 
< 0.1%
4001
 
< 0.1%
3701
 
< 0.1%
3661
 
< 0.1%
36529
0.1%
3641
 
< 0.1%

number_of_reviews
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct394
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.27446569
Minimum0
Maximum629
Zeros10052
Zeros (%)20.6%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:17.566975image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median5
Q324
95-th percentile114
Maximum629
Range629
Interquartile range (IQR)23

Descriptive statistics

Standard deviation44.55058227
Coefficient of variation (CV)1.91413985
Kurtosis19.52978807
Mean23.27446569
Median Absolute Deviation (MAD)5
Skewness3.690634572
Sum1138005
Variance1984.75438
MonotonicityNot monotonic
2022-02-22T12:20:17.710696image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
010052
20.6%
15244
 
10.7%
23465
 
7.1%
32520
 
5.2%
41994
 
4.1%
51618
 
3.3%
61357
 
2.8%
71179
 
2.4%
81127
 
2.3%
9964
 
2.0%
Other values (384)19375
39.6%
ValueCountFrequency (%)
010052
20.6%
15244
10.7%
23465
 
7.1%
32520
 
5.2%
41994
 
4.1%
51618
 
3.3%
61357
 
2.8%
71179
 
2.4%
81127
 
2.3%
9964
 
2.0%
ValueCountFrequency (%)
6291
< 0.1%
6071
< 0.1%
5971
< 0.1%
5941
< 0.1%
5761
< 0.1%
5431
< 0.1%
5401
< 0.1%
5101
< 0.1%
4881
< 0.1%
4801
< 0.1%

last_review
Categorical

HIGH CARDINALITY
MISSING

Distinct1764
Distinct (%)4.5%
Missing10052
Missing (%)20.6%
Memory size2.8 MiB
2019-06-23
 
1413
2019-07-01
 
1359
2019-06-30
 
1341
2019-06-24
 
875
2019-07-07
 
718
Other values (1759)
33137 

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique236 ?
Unique (%)0.6%

Sample

1st row2018-10-19
2nd row2019-05-21
3rd row2019-07-05
4th row2018-11-19
5th row2019-06-22

Common Values

ValueCountFrequency (%)
2019-06-231413
 
2.9%
2019-07-011359
 
2.8%
2019-06-301341
 
2.7%
2019-06-24875
 
1.8%
2019-07-07718
 
1.5%
2019-07-02658
 
1.3%
2019-06-22655
 
1.3%
2019-06-16601
 
1.2%
2019-07-05580
 
1.2%
2019-07-06565
 
1.2%
Other values (1754)30078
61.5%
(Missing)10052
 
20.6%

Length

2022-02-22T12:20:17.812052image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019-06-231413
 
3.6%
2019-07-011359
 
3.5%
2019-06-301341
 
3.5%
2019-06-24875
 
2.3%
2019-07-07718
 
1.8%
2019-07-02658
 
1.7%
2019-06-22655
 
1.7%
2019-06-16601
 
1.5%
2019-07-05580
 
1.5%
2019-07-06565
 
1.5%
Other values (1754)30078
77.4%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

reviews_per_month
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct937
Distinct (%)2.4%
Missing10052
Missing (%)20.6%
Infinite0
Infinite (%)0.0%
Mean1.37322143
Minimum0.01
Maximum58.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:17.905711image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0.01
5-th percentile0.04
Q10.19
median0.72
Q32.02
95-th percentile4.64
Maximum58.5
Range58.49
Interquartile range (IQR)1.83

Descriptive statistics

Standard deviation1.680441995
Coefficient of variation (CV)1.223722525
Kurtosis42.49346948
Mean1.37322143
Median Absolute Deviation (MAD)0.62
Skewness3.130188536
Sum53340.04
Variance2.823885299
MonotonicityNot monotonic
2022-02-22T12:20:18.047209image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.02919
 
1.9%
1893
 
1.8%
0.05893
 
1.8%
0.03804
 
1.6%
0.16667
 
1.4%
0.04655
 
1.3%
0.08596
 
1.2%
0.09593
 
1.2%
0.06579
 
1.2%
0.11539
 
1.1%
Other values (927)31705
64.8%
(Missing)10052
 
20.6%
ValueCountFrequency (%)
0.0142
 
0.1%
0.02919
1.9%
0.03804
1.6%
0.04655
1.3%
0.05893
1.8%
0.06579
1.2%
0.07466
1.0%
0.08596
1.2%
0.09593
1.2%
0.1457
0.9%
ValueCountFrequency (%)
58.51
< 0.1%
27.951
< 0.1%
20.941
< 0.1%
19.751
< 0.1%
17.821
< 0.1%
16.811
< 0.1%
16.221
< 0.1%
16.031
< 0.1%
15.781
< 0.1%
15.321
< 0.1%

calculated_host_listings_count
Real number (ℝ≥0)

Distinct47
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.143982002
Minimum1
Maximum327
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:18.188047image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile15
Maximum327
Range326
Interquartile range (IQR)1

Descriptive statistics

Standard deviation32.95251885
Coefficient of variation (CV)4.612626241
Kurtosis67.5508883
Mean7.143982002
Median Absolute Deviation (MAD)0
Skewness7.9331739
Sum349305
Variance1085.868499
MonotonicityNot monotonic
2022-02-22T12:20:18.333586image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
132303
66.1%
26658
 
13.6%
32853
 
5.8%
41440
 
2.9%
5845
 
1.7%
6570
 
1.2%
8416
 
0.9%
7399
 
0.8%
327327
 
0.7%
9234
 
0.5%
Other values (37)2850
 
5.8%
ValueCountFrequency (%)
132303
66.1%
26658
 
13.6%
32853
 
5.8%
41440
 
2.9%
5845
 
1.7%
6570
 
1.2%
7399
 
0.8%
8416
 
0.9%
9234
 
0.5%
10210
 
0.4%
ValueCountFrequency (%)
327327
0.7%
232232
0.5%
121121
 
0.2%
103103
 
0.2%
96192
0.4%
9191
 
0.2%
8787
 
0.2%
6565
 
0.1%
52104
 
0.2%
5050
 
0.1%

availability_365
Real number (ℝ≥0)

ZEROS

Distinct366
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.7813273
Minimum0
Maximum365
Zeros17533
Zeros (%)35.9%
Negative0
Negative (%)0.0%
Memory size382.1 KiB
2022-02-22T12:20:18.474894image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median45
Q3227
95-th percentile359
Maximum365
Range365
Interquartile range (IQR)227

Descriptive statistics

Standard deviation131.6222889
Coefficient of variation (CV)1.167057455
Kurtosis-0.9975340452
Mean112.7813273
Median Absolute Deviation (MAD)45
Skewness0.7634075771
Sum5514443
Variance17324.42692
MonotonicityNot monotonic
2022-02-22T12:20:18.646819image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
017533
35.9%
3651295
 
2.6%
364491
 
1.0%
1408
 
0.8%
89361
 
0.7%
5340
 
0.7%
3306
 
0.6%
179301
 
0.6%
90290
 
0.6%
2270
 
0.6%
Other values (356)27300
55.8%
ValueCountFrequency (%)
017533
35.9%
1408
 
0.8%
2270
 
0.6%
3306
 
0.6%
4233
 
0.5%
5340
 
0.7%
6245
 
0.5%
7219
 
0.4%
8233
 
0.5%
9193
 
0.4%
ValueCountFrequency (%)
3651295
2.6%
364491
 
1.0%
363239
 
0.5%
362166
 
0.3%
361111
 
0.2%
360102
 
0.2%
359135
 
0.3%
358180
 
0.4%
35795
 
0.2%
35678
 
0.2%

Interactions

2022-02-22T12:20:11.806522image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:57.501510image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:59.062987image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:00.688528image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:02.098747image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:03.790875image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:05.348495image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:06.983236image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:08.610718image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:10.181285image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:11.968599image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:57.703352image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:59.241657image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:00.820248image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:02.296572image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:04.009947image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:05.513801image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:07.149715image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:08.768272image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:10.342562image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:12.128567image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:57.857628image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:59.386969image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:00.982941image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:02.464157image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:04.194204image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:05.703506image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:07.283859image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:08.948334image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:10.481965image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:12.309618image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:57.999142image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:59.533686image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:01.101732image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:02.603292image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:04.342731image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:05.863258image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:07.442008image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:09.068169image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:10.658352image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:12.477651image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:58.139502image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:59.666382image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:01.229176image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:02.785419image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:04.495866image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:06.042132image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:07.583591image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:09.174797image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:10.800107image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:12.635115image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:58.283920image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:59.801158image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:01.368350image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:02.984049image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:04.606276image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:06.201488image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:07.856453image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:09.340193image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:10.961167image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:12.817066image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:58.429541image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:00.060246image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:01.507274image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:03.120273image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:04.732281image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:06.327408image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:07.983361image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:09.498583image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:11.093028image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:12.992105image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:58.571604image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:00.229976image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:01.652048image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:03.279832image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:04.882995image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:06.492591image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:08.171046image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:09.670765image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:11.256046image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:13.153393image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:58.752952image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:00.369607image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:01.801254image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:03.435590image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:05.015651image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:06.674100image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:08.296741image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:09.840094image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:11.386087image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:13.278510image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:19:58.903867image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:00.524844image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:01.963821image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:03.619402image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:05.192062image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:06.821240image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:08.451119image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:10.016770image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-22T12:20:11.659235image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Correlations

2022-02-22T12:20:18.800656image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-02-22T12:20:19.003758image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-02-22T12:20:19.191424image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-02-22T12:20:19.365846image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.
2022-02-22T12:20:19.490902image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-02-22T12:20:13.530459image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
A simple visualization of nullity by column.
2022-02-22T12:20:13.915892image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2022-02-22T12:20:14.232909image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
2022-02-22T12:20:14.384218image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

idnamehost_idhost_nameneighbourhood_groupneighbourhoodlatitudelongituderoom_typepriceminimum_nightsnumber_of_reviewslast_reviewreviews_per_monthcalculated_host_listings_countavailability_365
02539Clean & quiet apt home by the park2787JohnBrooklynKensington40.64749-73.97237Private room149192018-10-190.216365
12595Skylit Midtown Castle2845JenniferManhattanMidtown40.75362-73.98377Entire home/apt2251452019-05-210.382355
23647THE VILLAGE OF HARLEM....NEW YORK !4632ElisabethManhattanHarlem40.80902-73.94190Private room15030NaNNaN1365
33831Cozy Entire Floor of Brownstone4869LisaRoxanneBrooklynClinton Hill40.68514-73.95976Entire home/apt8912702019-07-054.641194
45022Entire Apt: Spacious Studio/Loft by central park7192LauraManhattanEast Harlem40.79851-73.94399Entire home/apt801092018-11-190.1010
55099Large Cozy 1 BR Apartment In Midtown East7322ChrisManhattanMurray Hill40.74767-73.97500Entire home/apt2003742019-06-220.591129
65121BlissArtsSpace!7356GaronBrooklynBedford-Stuyvesant40.68688-73.95596Private room6045492017-10-050.4010
75178Large Furnished Room Near B'way8967ShunichiManhattanHell's Kitchen40.76489-73.98493Private room7924302019-06-243.471220
85203Cozy Clean Guest Room - Family Apt7490MaryEllenManhattanUpper West Side40.80178-73.96723Private room7921182017-07-210.9910
95238Cute & Cozy Lower East Side 1 bdrm7549BenManhattanChinatown40.71344-73.99037Entire home/apt15011602019-06-091.334188

Last rows

idnamehost_idhost_nameneighbourhood_groupneighbourhoodlatitudelongituderoom_typepriceminimum_nightsnumber_of_reviewslast_reviewreviews_per_monthcalculated_host_listings_countavailability_365
4888536482809Stunning Bedroom NYC! Walking to Central Park!!131529729KendallManhattanEast Harlem40.79633-73.93605Private room7520NaNNaN2353
4888636483010Comfy 1 Bedroom in Midtown East274311461ScottManhattanMidtown40.75561-73.96723Entire home/apt20060NaNNaN1176
4888736483152Garden Jewel Apartment in Williamsburg New York208514239MelkiBrooklynWilliamsburg40.71232-73.94220Entire home/apt17010NaNNaN3365
4888836484087Spacious Room w/ Private Rooftop, Central location274321313KatManhattanHell's Kitchen40.76392-73.99183Private room12540NaNNaN131
4888936484363QUIT PRIVATE HOUSE107716952MichaelQueensJamaica40.69137-73.80844Private room6510NaNNaN2163
4889036484665Charming one bedroom - newly renovated rowhouse8232441SabrinaBrooklynBedford-Stuyvesant40.67853-73.94995Private room7020NaNNaN29
4889136485057Affordable room in Bushwick/East Williamsburg6570630MarisolBrooklynBushwick40.70184-73.93317Private room4040NaNNaN236
4889236485431Sunny Studio at Historical Neighborhood23492952Ilgar & AyselManhattanHarlem40.81475-73.94867Entire home/apt115100NaNNaN127
488933648560943rd St. Time Square-cozy single bed30985759TazManhattanHell's Kitchen40.75751-73.99112Shared room5510NaNNaN62
4889436487245Trendy duplex in the very heart of Hell's Kitchen68119814ChristopheManhattanHell's Kitchen40.76404-73.98933Private room9070NaNNaN123